camera node
Resource-Efficient Multiview Perception: Integrating Semantic Masking with Masked Autoencoders
Dakic, Kosta, Thilakarathna, Kanchana, Calheiros, Rodrigo N., Lim, Teng Joon
Multiview systems have become a key technology in modern computer vision, offering advanced capabilities in scene understanding and analysis. However, these systems face critical challenges in bandwidth limitations and computational constraints, particularly for resource-limited camera nodes like drones. This paper presents a novel approach for communication-efficient distributed multiview detection and tracking using masked autoencoders (MAEs). We introduce a semantic-guided masking strategy that leverages pre-trained segmentation models and a tunable power function to prioritize informative image regions. This approach, combined with an MAE, reduces communication overhead while preserving essential visual information. We evaluate our method on both virtual and real-world multiview datasets, demonstrating comparable performance in terms of detection and tracking performance metrics compared to state-of-the-art techniques, even at high masking ratios. Our selective masking algorithm outperforms random masking, maintaining higher accuracy and precision as the masking ratio increases. Furthermore, our approach achieves a significant reduction in transmission data volume compared to baseline methods, thereby balancing multiview tracking performance with communication efficiency.
Integrating AI into CCTV Systems: A Comprehensive Evaluation of Smart Video Surveillance in Community Space
Yao, Shanle, Ardabili, Babak Rahimi, Pazho, Armin Danesh, Noghre, Ghazal Alinezhad, Neff, Christopher, Tabkhi, Hamed
This article presents an AI-enabled Smart Video Surveillance (SVS) designed to enhance safety in community spaces such as educational and recreational areas, and small businesses. The proposed system innovatively integrates with existing CCTV and wired camera networks, simplifying its adoption across various community cases to leverage recent AI advancements. Our SVS system, focusing on privacy, uses metadata instead of pixel data for activity recognition, aligning with ethical standards. It features cloud-based infrastructure and a mobile app for real-time, privacy-conscious alerts in communities. This article notably pioneers a comprehensive real-world evaluation of the SVS system, covering AI-driven visual processing, statistical analysis, database management, cloud communication, and user notifications. It's also the first to assess an end-to-end anomaly detection system's performance, vital for identifying potential public safety incidents. For our evaluation, we implemented the system in a community college, serving as an ideal model to exemplify the proposed system's capabilities. Our findings in this setting demonstrate the system's robustness, with throughput, latency, and scalability effectively managing 16 CCTV cameras. The system maintained a consistent 16.5 frames per second (FPS) over a 21-hour operation. The average end-to-end latency for detecting behavioral anomalies and alerting users was 26.76 seconds.
Heteroskedastic Geospatial Tracking with Distributed Camera Networks
Samplawski, Colin, Fang, Shiwei, Wang, Ziqi, Ganesan, Deepak, Srivastava, Mani, Marlin, Benjamin M.
Visual object tracking has seen significant progress in recent years. However, the vast majority of this work focuses on tracking objects within the image plane of a single camera and ignores the uncertainty associated with predicted object locations. In this work, we focus on the geospatial object tracking problem using data from a distributed camera network. The goal is to predict an object's track in geospatial coordinates along with uncertainty over the object's location while respecting communication constraints that prohibit centralizing raw image data. We present a novel single-object geospatial tracking data set that includes high-accuracy ground truth object locations and video data from a network of four cameras. We present a modeling framework for addressing this task including a novel backbone model and explore how uncertainty calibration and fine-tuning through a differentiable tracker affect performance.
Predicting Topological Maps for Visual Navigation in Unexplored Environments
Zhan, Huangying, Rezatofighi, Hamid, Reid, Ian
We propose a robotic learning system for autonomous exploration and navigation in unexplored environments. We are motivated by the idea that even an unseen environment may be familiar from previous experiences in similar environments. The core of our method, therefore, is a process for building, predicting, and using probabilistic layout graphs for assisting goal-based visual navigation. We describe a navigation system that uses the layout predictions to satisfy high-level goals (e.g. "go to the kitchen") more rapidly and accurately than the prior art. Our proposed navigation framework comprises three stages: (1) Perception and Mapping: building a multi-level 3D scene graph; (2) Prediction: predicting probabilistic 3D scene graph for the unexplored environment; (3) Navigation: assisting navigation with the graphs. We test our framework in Matterport3D and show more success and efficient navigation in unseen environments.
Intro to ROS -- ROS Tutorials 0.5.2 documentation
Let's look at the ROS system from a very high level view. No need to worry how any of the following works, we will cover that later. ROS starts with the ROS Master. The Master allows all other ROS pieces of software (Nodes) to find and talk to each other. That way, we do not have to ever specifically state "Send this sensor data to that computer at 127.0.0.1. We can simply tell Node 1 to send messages to Node 2. How do Nodes do this?
Self-driving RC car using Robotic Operating System(ROS)
During my childhood, I dreamed about cars drives itself and never thought world would achieve such a great technology. Nowadays, lots of companies are working hard to bring the self driving cars into realities. I was very interested to learn about the technology, but wasn't sure where to get started. Finally, I decided to join Udacity's Self-driving Car Engineer course to know the technology in depth. In order to get the practical exposure to the projects did for first term, started building self driving technologies in remote controlled car using Robotic Operating System(ROS).